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SEQUENCE LISTING 

1 (1) GENERAL INFORMATION: 

2 (i) APPLICANT: LI, SHENGFENG 

3 PHILLIPS, DAVID 

4 (ii) TITLE OF INVENTION: IDENTIFICATION OF Bap-1, A PROTEIN THAT 

5 BINDS TO INTEGRIN AND IS INVOLVED IN INTEGRIN 

6 MEDIATED 

7 SIGNAL TRANSDUCTION 

8 (iii) NUMBER OF SEQUENCES: 5 

9 (iv) CORRESPONDENCE ADDRESS: 

10 (A) ADDRESSEE: MORGAN, LEWIS & BOCKIUS LLP 

11 (B) STREET: 1800 M ST., NW 

12 (C) CITY: WASHINGTON Q"irf!!Q? 

13 (D) STATE : DC fMI feT^ 

14 (E) COUNTRY: USA B 

15 (F) ZIP : 20036 

16 (v) COMPUTER READABLE FORM: 

17 (A) MEDIUM TYPE: Floppy disk 

18 (B) COMPUTER: IBM PC compatible 

19 (C) OPERATING SYSTEM: PC-DOS/MS -DOS 

20 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

21 (vi) CURRENT APPLICATION DATA: 

C--> 22 (A) APPLICATION NUMBER: US/09/991, 888 

C--> 23 (B) FILING DATE: 26-NOV-2001 

24 (vii) PRIOR APPLICATION DATA: 

25 (A) APPLICATION NUMBER: US/09/590,175 

26 (B) FILING DATE: 09-Jun-2000 

27 (A) APPLICATION NUMBER: US 08/753,038 

28 (B) FILING DATE: 1996-11-18 

29 (A) APPLICATION NUMBER: US 08/972,719 

30 (B) FILING DATE: 1997-11-18 

31 (viii) ATTORNEY/AGENT INFORMATION: 

32 (A) NAME: REID G. ADLER 

33 (B) REGISTRATION NUMBER: 30,988 

34 (C) REFERENCE/DOCKET NUMBER: 44481-5025-02-US 

35 (ix) TELECOMMUNICATION INFORMATION: 

36 (A) TELEPHONE: (202) 467-7000 

37 (B) TELEFAX: (202) 467-7258 

38 (2) INFORMATION FOR SEQ ID NO : 1: 

39 (i) SEQUENCE CHARACTERISTICS: 

40 (A) LENGTH: 3467 base pairs 

41 (B) TYPE: nucleic acid 

42 (C) STRANDEpfJESS : double 
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43 




(D) TOPOLOGY: 


linear 


















44 


(ii) 


MOLECULE TYPE: 


DNA (genomic) 


















45 


(ix) 


FEATURE : 




















46 




(A) NAME/KEY: 


CDS 


















47 




(B) LOCATION: 


226 . .1233 


















48 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


: 1: 
















49 


GCCCCTCGCT CGCTCGCTCC TTCCCGCCCT CCCCGCAGCG 


CCGGCCGAGC 


CGGCTTCCCC 


bU 


50 


TCAGTCTCTC ATGAATATTG AGCGGCCCCT GTTGTATTTC 


CCGAGCTCCA 


TTGCGGAAGC 


IzU 


51 


TGAGGCTCGC CATATTGTGC GGCGGCGCCG GCGTCCGCGG 


CAGCTGATAC 


CAGAGTCTTG 


1 on 
lo 0 


52 


CTCCGGCCGC GGCCAGCGGA GCCCTGGGCT GGGGCAGGAG 


CCGCA ATG TCT CAG 


Z 34 


53 














Met Ser Gin 




54 
















1 








55 


GCT 


hth pap. ara aap 


PP A APT PAA A 
obH ntl LM bbA 


1 1 A 


AGC 


AAA 


A A 

ALA 


1 bb 


r* a a 
bAA 


bib 


2 82 


56 


Ala 


\Tp> 1 f2l r\ Thr Acn 
vai ulu l ill noil 


biy inr bin Fro 


Leu 


Ser 


Lys 


m u v 

i nr 


Trp 


bill 


Leu 




57 




5 


1 u 






1j 












58 


apt 

nb 1 


TTA TAT P A P TTA 


LAA LbA ACA LCI 


CAG 


GAG 


GCA 


ATA 
AIA 


A /"* A 

AbA 


GAT 


GGC 


3 30 


59 




-ucu. iyi biu jueu 


bin Arg inr pro 


Gin 


Glu 


Ala 


i le 


l nr 


Asp 


bly 




60 






Z D 




3 (J 








O D 




61 


TTA 


P.AA ATT PTP. PTT 
\Jir\r\ nil bib oil 


TPS. PPT /^T" 1 A A/"""P 

ibA bbl LbA Abl 


CTA 


CAC 


AGT 


GAA 


TTA 


ATG 


TGC 


37 8 


62 


Leu 


Pin T 1 a Va 1 Va 1 
olU lie Val Val 


Ser Pro Arg Ser 


Leu 


His 


Ser 


blU 


Leu 


Met 


Cys 




63 








A C 










D U 




64 


CCA 


ATT TflT TTP. PAT 
nil lol 1 1 b bnl 


ATP TTf" A A r* A A r* 

Alb lib Mb AAb 


Abb 


ATG 


ACT 


TV /"l A 

AbA 


AAG 


GAG 


TGT 


426 


65 


D TT^ 

r -L\J 


lie Lyb ijeu ASp 


Met Leu Lys Asn 


Thr 


Met 


Thr 


Thr 


Lys 


b J_U 


Cys 




66 






b U 










0 _> 






67 


TTA 
1 In. 


PAT PPT TTT TPT 
Inl tbi 111 lb 1 


btA bAb Ibb ATC 


ATC 


ACA 


GCC 


b 1 1 


AbA 


A r* rp 
Ab 1 


bbb 


4 74 


68 


Leu 


nxo niy irllt: byib 


Aia Asp bys lie 


Tl 

1 le 


Thr 


Ala 


Leu 


Arg 


Ser 


bly 




69 




7 n 


/ D 








80 








70 


AAC 


AAA GAA TGT CCT 


ACC TGT CGG AAA 


AAA 


CTA 


GTT 


TCC 


AAA 


AGA 


TCA 


522 


71 


Asn 


Lys Glu Cys Pro 


Thr Cys Arg Lys 


Lys 


Leu 


Val 


Ser 


Lys 


Arg 


Ser 




72 




85 


90 






95 










73 


CTA 


AGG CCA GAC CCA 


AAC TTT GAT GCA 


CTC 


ATC 


AGC 


AAA 


ATT 


TAT 


CCA 


570 


74 


Leu 


Arg Pro Asp Pro 


Asn Phe Asp Ala 


Leu 


He 


Ser 


Lys 


He 


Tyr 


Pro 




75 


100 




105 




110 








115 




76 


AGT 


CGT GAT GAG TAT 


GAA GCT CAT CAA 


GAG 


AGA 


GTA 


TTA 


GCC 


AGG 


ATC 


618 


77 


Ser 


Arg Asp Glu Tyr 


Glu Ala His Gin 


Glu 


Arg 


Val 


Leu 


Ala 


Arg 


He 




78 




120 




125 










130 






79 


AAC 


AAG CAC AAT AAT 


CAG CAA GCA CTC 


AGT 


CAC 


AGC 


ATT 


GAG 


GAA 


GGA 


666 


80 


Asn 


Lys His Asn Asn 


Gin Gin Ala Leu 


Ser 


His 


Ser 


He 


Glu 


Glu 


Gly 




81 




135 


140 










145 






82 


CTG 


AAG ATA CAG GCC 


ATG AAC AGA CTG 


CAG 


CGA 


GGC 


AAG 


AAA 


CAA 


CAG 


714 


83 


Leu 


Lys lie Gin Ala 


Met Asn Arg Leu 


Gin 


Arg 


Gly 


Lys 


Lys 


Gin 


Gin 




84 




150 


155 








160 










85 


ATT 


GAA AAT GGT AGT 


GGA GCA GAA GAT 


AAT 


GGT 


GAC 


AGT 


TCA 


CAC 


TGC 


762 


86 


He 


Glu Asn Gly Ser 


Gly Ala Glu Asp 


Asn 


Gly 


Asp 


Ser 


Ser 


His 


Cys 




87 




165 


170 






175 












88 


AGT 


AAT GCA TCC ACA 


CAT AGC AAT CAG 


GAA 


GCA 


GGC 


CCT 


AGT 


AAC 


AAA 


810 


89 


Ser 


Asn Ala Ser Thr 


His Ser Asn Gin 


Glu 


Ala 


Gly 


Pro 


Ser 


Asn 


Lys 




90 


180 




185 




190 










195 




91 


CGG 


ACC AAA ACA TCT 


GAT GAT TCT GGG 


CTA 


GAG 


CTT 


GAT 


AAT 


AAC 


AAT 


858 



file://C:\Crf3\Outhold\VsrI991888.htm 7/25/02 



Page 3 of 8 



PATENT APPLICATION: US/09/991 , 888 TIME: 14:55:38 

Input Set : N:\Crf3\RULE60\09991888.raw 
Output Set: N:\CRF3\07252002\I991888.raw 

92 Arg Thr Lys Thr Ser Asp Asp Ser Gly Leu Glu Leu Asp Asn Asn Asn 

93 200 205 210 

94 GCA GCA ATG GCA ATT GAT CCA GTA ATG GAT GGT GCT AGT GAA ATT GAA 

95 Ala Ala Met Ala He Asp Pro Val Met Asp Gly Ala Ser Glu He Glu 

96 215 220 225 

97 TTA GTA TTC AGG CCT CAT CCC ACA CTT ATG GAA AAA GAT GAC AGT GCA 

98 Leu Val Phe Arg Pro His Pro Thr Leu Met Glu Lys Asp Asp Ser Ala 

99 230 235 240 

100 CAG ACG AGA TAC ATA AAG ACT TCT GGT AAC GCC ACT GTT GAT CAC TTA 

101 Gin Thr Arg Tyr He Lys Thr Ser Gly Asn Ala Thr Val Asp His Leu 

102 245 250 255 

103 TCC AAG TAT CTG GCT GTG AGG TTA GCT TTA GAA GAA CTT CGA AGC AAA 

104 Ser Lys Tyr Leu Ala Val Arg Leu Ala Leu Glu Glu Leu Arg Ser Lys 

105 260 265 270 275 

106 GGT GAA TCA AAC CAG ATG AAC CTT GAT ACA GCC AGT GAG AAG CAG TAT 

107 Gly Glu Ser Asn Gin Met Asn Leu Asp Thr Ala Ser Glu Lys Gin Tyr 
1° 8 280 285 290 

109 ACC ATT TAT ATA GCA ACA GCC AGT GGC CAG TTC ACT GTA TTA A AT GGC 

110 Thr He Tyr He Ala Thr Ala Ser Gly Gin Phe Thr Val Leu Asn Gly 
HI 295 300 305 

112 TCT TTT TCT TTG GAA TTG GTC AGT GAG AAA TAC TGG AAA GTG AAC AAA 

113 Ser Phe Ser Leu Glu Leu Val Ser Glu Lys Tyr Trp Lys Val Asn Lys 

114 310 315 320 

115 CCC ATG GAA CTT TAT TAC GCA CCT ACA AAG GAG CAC AAA TGAGCCTTTA 

116 Pro Met Glu Leu Tyr Tyr Ala Pro Thr Lys Glu His Lys 
H7 325 330 335 

118 AAAACCAATT CTGAGACTGA ACTTTTTTAT AGCCTATTTC TTTAATATTA AAGATGTACT 

119 GGCATTACTT TTATGGAGAT CTTGGATATG TTGTTCAATT TTCTTTCTGA GCCAGACTAG 

120 TTTACGCTAT TCAAATCTTT TCCCCTTTAT TTAAGATTTC CTTTTTGGAA GGGACTGCAA 

121 TTATTCAGTA TTTTTTTCTT TCCTTTAAAA AAATATATCT GAAGTTTCTT GTGTTTTTTT 

122 TTTTCCCCAC AAAGTGTGTT TCCACTTGGA GCACCATTTT GACCCAGGAA TTTTTCATAG 

123 TTTCTGTATT CTTATAAGAT TCAGTTGGCT GTCCTTTTCC TGCTCCCCTC AAAAGATTTT 

124 TAGTCATACA GAATGTTAAA TATTATGTAT TCTGACTTTT TTTTTCCCCC GGAGTCTTGT 

125 ATATTTATAG TTTTCCTATA TAAACTGTAG TATCTTCATG AAGAACCCAA GGCTCAAATT 

126 TACTGTCCTT AAAAACAATT CTCATAGGAT TATTCTTTTC ATGGTATTTT CTTCCATAAT 

127 ATCTCATTTT AAAAAGAAGT TCTTTATGAA ACTTAGTGTC CATTGTCATG CAATGTTTTT 

128 TTTTTCCATT CTTTTTCCCC TGTAATTTTG GAATTTCTGG TCCTGGGAAG AATCAAACAA 

129 AATCTTAAGT TCTATGAGAA CTTGGTTCAT TGACATATTC TGCTGAAGAA AGAAAAATTA 

130 AATTGGTAGT AAAATATAGT CTTCAAGTAT ACGTTTGAGA GTGCTTTTTT TTGTATTAGT 

131 TCTGCTGTCA CTTCATTTCC TGTATTATAT GTGATGTTTT TCCCCATTAA AATACCAGAG 

132 ATAATGGAGA TATTTTGCAC TTTAGCCTTG ATGAAAAGTA CAAGATATGT TCAAAGCTTC 

133 CCTAATTTTT TTCTTATTTG TAGCCACATA AGTTTCAAGA ATAACATGGC ACACAGAACA 
13 4 ATGGAAAAAA GTTTGTTTCC ATTGGAAAAT TATATCATTT TGGGTTGCCA CATCAGTTTA 
13 5 TAAATTTGGC GCTCTTTTAA TTACACTCTG TAGAAGGTTA ATAGAGCTTG AGCCCTGCTT 
13 6 TAATATGTAG TGAAAGATAA TTCTGTAGAA AAACGTCAGC CAGTAGGGTA AAGTCATTCT 
13 7 ACTGTTCTTA ATTTTTATAT TGAGGAACAA TATTGGGTGT TTGGGAGCCA GAAAGCTTTG 
13 8 TTGACAGATC AGAAATAAGA TTGACTTGGG TGTTATATTT CATCTCTCTC CAGACTCTAG 

13 9 GTATATTTCC AACTTTATAT ATCACAGTAT TTAAAAAGAC ATGTTTGCAT TGAGAAATTA 

14 0 ACCCTAAAGG GTTTTCAATA GGGTGTAGAC CTCCAGTACC TTTGTAACTA AAGTCTGTCT 
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Trp Glu Leu Ser Leu Tyr Glu Leu Gin Arg Thr Pro Gin Glu Ala He 

167 20 25 30 

168 Thr Asp Gly Leu Glu He Val Val Ser Pro Arg Ser Leu His Ser Glu 

169 35 40 45 

170 Leu Met Cys Pro He Cys Leu Asp Met Leu Lys Asn Thr Met Thr Thr 

171 50 55 60 

172 Lys Glu Cys Leu His Arg Phe Cys Ala Asp Cys He He Thr Ala Leu 

173 65 70 75 80 

174 Arg Ser Gly Asn Lys Glu Cys Pro Thr Cys Arg Lys Lys Leu Val Ser 

175 85 90 95 

17 6 Lys Arg Ser Leu Arg Pro Asp Pro Asn Phe Asp Ala Leu He Ser Lys 

177 100 105 110 

178 He Tyr Pro Ser Arg Asp Glu Tyr Glu Ala His Gin Glu Arg Val Leu 
1 ? 9 115 120 125 

Ala Arg He Asn Lys His Asn Asn Gin Gin Ala Leu Ser His Ser He 



180 



181 130 135 



182 

183 145 " 150 '"" "l55 

184 

185 165 
186 



140 



Glu Glu Gly Leu Lys He Gin Ala Met Asn Arg Leu Gin Arg Gly Lys 



160 



Lys Gin Gin He Glu Asn Gly Ser Gly Ala Glu Asp Asn Gly Asp Ser 



170 175 



Ser His Cys Ser Asn Ala Ser Thr His Ser Asn Gin Glu Ala Gly Pro 

187 180 185 190 

188 Ser Asn Lys Arg Thr Lys Thr Ser Asp Asp Ser Gly Leu Glu Leu Asp 

189 195 200 205 

190 Asn Asn Asn Ala Ala Met Ala He Asp Pro Val Met Asp Gly Ala Ser 



205 
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141 AGTCATTGTA AATATTTATC TGTCAGTTTT GACAGATTGG GGCCAGCTTG ATGTTTTAAA 2 6 83 

142 TCTTCAGCCC GGTATGAAAA CTTAAAGGTA TATATTCAAT TTTTTACCAT TTTATGGAAA 2 74 3 

143 ATATTTAAAA TTTGTTTTTA CAGGGTTTTT TTTTTTTTTT TTTTTTTTTT GTAATCTGTG 2803 

144 CCATGAAATT TGAAAACCAC CAAAAATCAA GGGAACTTTT ATATATTCAA TTCCTTTTCT 2 863 
14 5 GGTGTAATGT TAAAGTTGTA TAGATTATTA ATGCATGCCC ACTGAATATA ACCCTGGTTT 2 923 
14 6 TGTGATAAAA CTGCTTAGAT TTTGTTGATG ACATTAGATT AGTAGTTGCA TTAAATAACT 2 983 
147 AAATTCCCAT TGTGATTAAT TGAAATTTTG TCTTTAAGCA GAGAGTTATT TGTGACTATA 3 04 3 
14 8 AGCTTTGTGC TTAGAGAATG TATGTGTTTT TATCTGTCAG TATGGGAGGA TATAAACTGC 3103 

149 ATCATTAGTG AAATTATTGG TTGTGTAATC CTTTGTGAAA TATAATTCTA GGTATTTGAT 3163 

150 AGGGTATTGA GTGTATTTTG TGTGTGTGTG GATGTGTGTT TTGGGGTACG GGGAGAGGCG 3 2 23 

151 ATGCTATTGG CCATCACTAC CAACCAGGGT TTCAAAAAGT ATATACCTAA GTAATTTCTT 3283 

152 TTATCACTAC CTCAACTGAG GAAGAAAAGG CTCACCACAA GTGGTGTGAA GGCTTTGGGT 3343 

153 ACTTAGTTCT AAATTTTTTT ATGGTAACAT ATACATAGCC ACATTTACAG TTTTAACCAT 3403 

154 TTTAAGGCAT GTAATTCAGT GGGGTTAGGT ACATTCACAA TGTTGTGTAA TGATCACCGC 3463 

155 CGTG 

157 (2) INFORMATION FOR SEQ ID NO: 2: 

158 (i) SEQUENCE CHARACTERISTICS: 

159 (A) LENGTH: 336 amino acids 

160 (B) TYPE: amino acid 

161 (D) TOPOLOGY: linear 

162 (ii) MOLECULE TYPE: protein 

163 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

164 Met Ser Gin Ala Val Gin Thr Asn Gly Thr Gin Pro Leu Ser Lys Thr 

165 1 5 
166 



3467 
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191 210 215 220 

III 225 Pr ° ™ r L6U Met Glu L ^ As P 

194 Asp Ser Ala Gin Thr Arg Tyr lie Lys Thr III Gly Asn Ala Thr Val 

245 250 255 



Asp His Leu Ser Lys Tyr Leu Ala Val Arg Leu Ala Leu Glu Glu Leu 



197 260 
198 

199 275 
200 



265 270 



Arg Ser Lys Gly Glu Ser Asn Gin Met Asn Leu Asp Thr Ala Ser Glu 



280 285 



201 ^ 290 ^ ^ ^ Ma Ser Gln Phe ^r Val 

202 Leu Asn Gly Ser Phe Ser Leu Glu Leu Val Ser G^u Lys Tyr Tr P Lys 

310 315 
204 val Asn Lys Pro Met Glu Leu Tyr Tyr Ala Pro Thr Lys Glu His lJs 

325 330 
20 7 (2) INFORMATION FOR SEQ ID NO: 3; 

208 (i) SEQUENCE CHARACTERISTICS: 

209 (A) LENGTH: 328 base pairs 

210 (B) TYPE: nucleic acid 

211 (C) STRANDEDNESS : double 

212 (D) TOPOLOGY: linear 

213 (ii) MOLECULE TYPE: DNA (genomic) 

214 (i X ) FEATURE: 

215 (A) NAME/KEY: CDS 

216 (B) LOCATION: 1. .327 

217 (ix) FEATURE: 

218 (A) NAME/KEY: misc feature 

219 (B) LOCATION: 1...325 

22? i-«s Qh l V °7 ER INF0RMATI0N -' p ^tial mouse sequence; nucleotides 
\\\ L ? u° W Homology to nucleotides 727-1051 of SEQ ID 

222 NO: 1 (human) . 

223 (xi) SEQUENCE DESCRIPTION: SEQ ID NO ■ 3- 

Ser G?v rf GGT GAC CAC TGT AGT AAC ^CA TCC 

Ser Gly Ala Glu Asp Asn Gly Asp Ser Ser His Cys Ser Asn Ala Ser 

~„ 5 10 is 

f ° GCG GG ° CCG AGT AAC AAA CGG ACC AAA ACC 

Thr his Ser Asn Gin Glu Ala Gly Pro Ser Asn Lys Arg Thr Lys Thr 

Ser A^o f C GAT GA " AAC AAC AAT GCA G ^A GTG GCG 

232 P 35 P " Gly ^ ASP LeU ASP Asn Asn Asn Ala Gly Val Ala 

ill Asp S~ ™ A ~" " AC " GT GGC AGT GAG ATT GAG ^ A GTG TTC AGG 
235 50 
236 
237 



224 
225 
226 



228 
229 
230 
231 



n. ^ Pro nl Met Asp 01y Ala ser -~ - - - - £ £ 



55 60 



CCC CAT CCA ACT CTT ATG GAA AAG GAC GAC AGC GCA CAG ACG AGA TAG 

238 65 ^ G1U LYS ASP ASP Ser A1 * G ^ Arg Tyr 

239 ATA AAG ACT TCA GGC AAT GCC ACT GTT GAT dc TTA TCC AAG TAT CxG 

240 He Lys Thr Ser Gly Asn Ala Thr Val Asp His Leu Ser lJs lyl Leu 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 

l?"? 2 T> 6 T re ^ 3 CO " es *> ondi ** explanation is presented Lnl h T<220> 

to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:5; Xaa Pos . 2 , 3 , 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 20 , 23 , 24 , 26 , 27 , 28 , 29 , 30 , 31 
Seq#:5; Xaa Pos . 32, 33, 34, 35, 36, 38, 39 ' ' ' 1 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/991,888 



DATE: 07/25/2002 
TIME: 14:55:39 



Input Set : N:\Crf3\RULE60\09991888.raw 
Output Set: N:\CRF3\07252002\I991888.raw 



L:22 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER • 1 
L:23 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE • 1 
L:281 M.-341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 after pos -0 
L:283 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 after pos '16 
L:285 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:5 after pos -32 
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